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Abstract 

\^ . The notion of Abelian complexity of infinite words was recently used by 

the three last authors to investigate various Abelian properties of words. 
In particular, using van der Waerden's theorem, they proved that if a word 
Cy ■ avoids Abelian fc-powers for some integer k, then its Abelian complexity 

is unbounded. This suggests the following question: How frequently do 
Abelian fc-powers occur in a word having bounded Abelian complexity? In 
particular, does every uniformly recurrent word having bounded Abelian 
complexity begin in an Abelian fc-power? While this is true for various 
classes of uniformly recurrent words, including for example the class of 
all Sturmian words, in this paper we show the existence of uniformly 

J/-V , recurrent binary words, having bounded Abelian complexity, which admit 

*^ ■ an infinite number of suffixes which do not begin in an Abelian square. 

We also show that the shift orbit closure of any infinite binary overlap- free 
word contains a word which avoids Abelian cubes in the beginning. We 
also consider the effect of morphisms on Abelian complexity and show 
that the morphic image of a word having bounded Abelian complexity 
has bounded Abelian complexity. Finally, we give an open problem on 
avoidability of Abelian squares in infinite binary words and show that 

ky( \ it is equivalent to a well-known open problem of Pirillo-Varricchio and 

-^ ■ Halbeisen-Hungerbiihler. 
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1 Introduction 

The notion of Abelian complexity of an infinite word was recently developed by 
the three last authors [12 to study various Abelian properties of words. This 
gave way to several other interesting results: While aperiodic balanced binary 
(i.e., Sturmian) and ternary words are examples of recurrent words with constant 
Abelian complexity 2 and 3, respectively (see [T^), Currie and Rampersad [3] 
showed that there are no recurrent words with constant Abelian complexity 
fc > 4. On the other hand, Saarela [14] showed that for any k > 2 there exist 
recurrent infinite words whose Abelian complexity is ultimately constant k. 

It is natural to try to determine the Abelian complexity of some well-known 
infinite words. For Sturmian words, this was essentially done already by Morse 
and Hedlund [10; by showing that these words are balanced. Characterizing 
the Abelian complexity of the Tribonacci word is substantially harder, and this 
was only partially achieved by the three last authors [13]. At any rate, the 
Abelian complexity of the Tribonacci word is bounded by 7. A similar result 
for a different class of ternary words was obtained very recently by Turek }18) . 

Before proceeding any further, let us quickly recall some basic terminology. 
Let A he a finite alphabet. For each word x G A* , we denote the length of 
X by I a: I and the number of occurrences of a letter b € A in x hy \x\b. If 
A — {fli, 02, . . . , flfe}, the Parikh vector of x is ^(x) :— {\x\ai , • ■ • , la^Ufc)- If two 
words x^y £ A* have the same Parikh vector, then we say that they are Abelian 
equivalent^ and we express this relation by writing x ^ab V- The empty word is 
denoted by e. 

An infinite word is an expression x = 00010203 • • • , where Oi € A; occasion- 
ally we index infinite words starting from 1 instead of 0. An infinite word of the 
form o„o„_|_io„+2 ■ ' ■ j where n > 1, is called a suffix of x; 

A finite word a; is a subword of x if we have x — o„o„+i • • • o„+|a.|_i for some 
integer n > 1. An infinite word x is called C -balanced, for some C > 0, if all 
subwords x, y of x with |x| — \y\ satisfy | \x\b — \y\b\ < C for aU letters b £ A. If 
C = 1, then we simply say that x is balanced. 

The subword complexity of x is the mapping px: N — > N for which Pxin) is 
the number of distinct subwords of length n. The Abelian complexity of x is the 
mapping p'^ : N — >■ N for which p'^{n) is the number of subwords of length n 
that are pairwise Abelian inequivalent. In [12j, the three last authors showed 
that Abelian complexity is closely linked to the notion of balance: 

Lemma 1 ([12]). An infinite word has bounded Abelian complexity if and only 
if it is C-balanced for some C > 0. 

A nonempty word w is an Abelian k-power, where fc > 2 is an integer, if 
it can be written in the form w = U1U2 ■ ■ ■ Uk with the Ui pairwise Abelian 
equivalent. Then we also say that w has (Abelian) period |mi|. According to a 
classical result, an infinite word is ultimately periodic if and only if its subword 
complexity is bounded. The next result, which relies on the well-known van der 
Waerden's theorem, may be considered as a partial Abelian analogue of this. 



Theorem 2 ([I2|)- If an infinite word has bounded AbeUan complexity, then 
it contains Abehan /c-powers for all integers fc > 1. 

This result naturally gives rise to the following question: 

Does every uniformly recurrent word with bounded Ahelian complex- 
ity begin in an Ahelian k-power for every k? 

It turns out that the Thue-Morse word, or more generally the infinite binary 
overlap-free words, nicely shed light to this question. Recall that the Thue- 
Morse word TMq = totit2 • • • is the fixed point of the morphism /i: {0, 1}* — >■ 
{0, 1}* starting with the letter 0; the binary overlap-free words are characterized 
by the absence of subwords of the form axaxa, where a £ {0,1}*. As is well- 
known, the letter i„ is given by the number of bits 1 in the binary expansion of 
n modulo 2. The Abelian complexity of the Thue-Morse word is obtained as a 
corollary of the following result. 

Theorem 3 ([H])- An aperiodic infinite word x has Abelian complexity 

I 3 if n is even 

if and only if x is of the form 0/z(y), l/i(y), or /i(y), where fx is the Thue-Morse 
morphism and y is any infinite binary word. 

In particular, the Thue-Morse word has a bounded Abelian complexity, and 
as a fixed point of a primitive morphism, it is also uniformly recurrent. In 
Section [2] we show that while every suffix of the Thue-Morse word begins in 
an Abelian A;-power, for k > 3, there is no upper bound for the length of the 
shortest such powers. It follows that there exist uniformly recurrent overlap- 
free binary words with bounded Abelian complexity which do not begin in an 
Abelian cube (i.e., an Abelian 3-power). So this already answers our original 
question, and incidentally, along the way it establishes a conjecture made by 
the second author in his PhD thesis [TS] namely that the Thue-Morse word is 
so-called everywhere Abelian 2-repetitive, but not 3-repetitive. 

Avoiding Abelian cubes in the beginning, however, is not the best possible 
answer. In Section[3l as a prelude to the remaining sections, we give an example 
of a uniformly recurrent binary word with bounded Abelian complexity that 
avoids Abelian squares (i.e., 2-powers) in the beginning. We then study two 
questions on the relation between morphisms and infinite words with bounded 
Abelian complexity. In Section |4l we characterize the morphisms which, like 
the Thue-Morse morphism, map any infinite word to a word with bounded 
Abelian complexity, and in Section [SJ we show that although the image under 
a morphism of a word can have unbounded Abelian complexity, all morphisms 
preserve bounded Abelian complexity. 

Then in Section [6] we construct an example of a uniformly recurrent binary 
word with bounded Abelian complexity avoiding Abelian squares in infinitely 
many positions. Lastly, in Section [71 we explain why this result is optimal 



and conclude with an open problem showing that a slight variation of this is 
equivalent to an older one raised independently by Pirillo-Varricchio [Tl] and 
Halbeisen-Hungerbiihler |S] . 

Let us finally remark that some other questions on the avoidance of Abelian 
powers in binary words have been considered before: Entringer, Jackson, and 
Schatz [7] showed that every infinite binary word contains arbitrarily long 
Abelian squares, and Dekking [4] constructed an infinite binary word that avoids 
Abelian 4-powers. It is not known whether there exist an infinite binary word 
avoiding long Abelian cubes (see Section 2.9 in [TB]V 

2 Abelian cubes and overlap-free words 

In this section, we let [a;] 2 denote the binary expansion of a positive integer x 
without leading zeros. For example, [2"] 2 — 10" for all integers n > 0. 

Theorem 4. Every sufhx of the Thue-Morse word TMq begins in an Abelian 
fc-power for all positive integers fc. 

Proof. Let us denote TMq — io^ii2 • ■ ■ ■ We first observe that there exists a 
positive integer m such that 

to — tm — t2m — ■ ■ ■ — t(fe+l)m (= 0). 

For example, we may take m = 2L'°S2(*=+i)J+i + i. This follows from the fact 
that the number of Is in the expansion [i ■ rn\2 is twice the number of Is in the 
expansion [i]2 

Another observation we need is that if x and y are any binary words of the 
same length, then the words ^.{x) and ^{y) are Abelian equivalent. 

Now let X be a sufhx of TMq, so that we have TMq = px for some p. 
Choose an integer n > 1 such that |/i"(0)| > \p\. Since /i"(0) is a prefix of 
TMq, there exists a word s such that ^'"(0) = ps. By the observation we made 
in the beginning of the proof, there exist words xi, 2:2, . . . , Xfc of the same length 
such that the word 

k 

(J|0xj)0 = 02;i0a;20---0xfc0 

1=1 

is a prefix of TMq. But then so is the word 

/x"(0xi0x20 • • • OxfcO) = psfi"" {xi)psij"' {x2)ps ■ ■ ■ psii"'{xk)ps. 

Therefore the suffix x begins in an Abelian fc-power 

sn'^{xi)p.sn'\x2)p sn'\xk)p- 

D 

Now we prove the existence of a binary overlap-free word that docs not begin 
in an Abelian cube (see Corollary El) . 



Lemma 5. Let x and z = 20^:12:2 • • • be infinite words such that x = /i(z), and 
let n > be an integer. If the position 2n + 1 in x has an occurrence of an 
Abehan 3-power with period i, then £ — 2k for some k and we have 

Proof. Let w denote an Abelian 3-power starting in position 2n + 1 in x, and 
let £ denote its period. First we show that £ is an even integer. To this end, 
suppose that £ is odd. Then since w occurs in an odd index, it is of the form 
w — aii{x).fj,{y)b.cii{z), where a,b,c G {0,1} and the words aii{x), fJ-{y)b, and 
c/x(z) are Abelian equivalent. Since each of the words x, y, and z are of the 
same length, their images fi{x), ii{y), and fj,{z) are trivially Abelian equivalent. 
Therefore we have a = b = c. But this is not possible because be is an image of 
a letter so that be € {01, 10}. 

Thus we may suppose that £ = 2k for some positive integer k. Then we have 

w — aii{x)b.eii{y)d.efi{z) f 

for some letters a,b,e,d,e, and / with be,de G {01,10}. Furthermore, each 
of the words afj,{x)b, efj,{y)d, and efi{z)f is of length 2fc, and they are pairwise 
Abelian equivalent. Now we have a ^ b; for otherwise the fact that b ^ e implies 
that the words afi{x)b and e^{y)d are not Abelian equivalent. Similarly we see 
that e ^ d and 67^/, and thus we conclude that either 

w = Qn{x)l.Q^l{y)l.Qn{z)l or w ^ l^i{x){).l^l{y)0.l^l{z)0 

In the first case, the occurrence of w extends to the left by the letter 1, so that 
the word Iw occurs at position 2n. It follows that the word Ixlyl occurs at 
position n. Since \lx\ = \ly\ = k, we therefore have z„ — Zn+k — Zn+2k = 1- In 
the second case we deduce similarly that z„ = Zn+k ~ Zn+2k = 0. D 

Lemma 6. Let n and k be integers with n > 1 and < fc < 2". Then there 
exists a word u E {0, 1}" such that 

[2"+i + fc]2== 10m and [2"+^ + 2fc + l]^ = Iwl. 

Proof. Since < A: < 2" — 1, we have 

2«+i < 2"+i + k< 2"+i + (2" - 1); 

in other words 

10"+1<1,, [2"+l+fc]2<lexl01", 

where <iox denotes the lexicographic order. Therefore [2"+^ + ^]2 ~ ^^^ ^'-'-'^ 
some u G {0, 1}". Thus we have 

[2"+i + fc] 2 = 10" implying 

[2(2"+i + fc)] 2 = 10"0 implying 

[2(2"+i + k) - 2"+i] 2 = luO implying 

[2(2"+i + k) - 2"+i + 1] 2 = 1"1 implying 

[2"+i + 2fc+l]2 = lMl. 



n 

Lemma 7. Let n > 1 be an integer. If an Abelian 3-power occurs in TMq at 
the position 2"+^ — 1, then its period is at least 2"+^. 

Proof. Suppose that w is an Abelian 3-power with period £ < 2""*"^ occurring 
at position 2"+^ — 1. Then by Lemma El we have that £ = 2k for some positive 
integer k, and 

Thus in particular the number of Is occurring in the expression [2"+^ — 1 + 
fc] has the same parity as the number of Is occurring in [2"+^ — 1 + 2fc] . 
Denoting k ^ j + 1, the inequality £ < 2"+^ gives < j < 2", which contradicts 
Lemma [H D 

An infinite word z is called everywhere Abelian k-repetitive if there exists an 
integer n > 1 such that every subword of length n has a prefix that is an Abelian 
fc-power. All Sturmian words, for example, are everywhere Abelian /c-repetitive 
for all integers k [12]. The first item in the next result proves a conjecture by 
the third author [15], and the second item shows that Abelian cubes need not 
occur in every position even in a uniformly recurrent binary word with bounded 
Abelian complexity. 

Corollary 8. The following three statements hold: 

(i) The Thue-Morse word TMq is not Abelian 3-repetitive. 

(ii) The words OTMq and ITMq do not have an Abelian cube as a prefix. 

(iii) All binary overlap-free infinite words are Abelian 2-repetitive but not 
Abelian 3-repetitive. 

Proof. Lenima[7]says that the length of the shortest Abelian 3-power occurring 
at positions of the form 2" — 1 grows arbitrarily large. Therefore the Thue-Morse 
word is not Abelian 3-repetitive, and ^ is proved. 

The second item follows from Lemma [7] since the word a/x"(l) occurs at 
position 2" — 1 where a is or 1 depending on the parity of n. Note that TMg 
is obtained from lim„_j.oo m(1) by exchanging Os and Is. 

Let us then prove item fuil) . Firstly, it is readily verified that every binary 
overlap-free word of length at least 10 has a prefix that is an Abelian square. 
Secondly, by Lemma 3 in AUouche et al. 2 , if an infinite word x is overlap-free, 
there exist a finite word p G {e, 0, 1, 00, 11} and an overlap-free infinite word 
y such that x = p/i(y). From this it follows that x contains the words /x"'(0) 
for all n > 1. In other words, every subword of the Thue-Morse word is in x, 
and consequently there is no number £ such that every position in x has an 
occurrence of an Abelian cube with period at most £. D 



3 Avoiding Abelian squares in the beginning 

In this section, we improve the second item of Corollary [8] We refer the reader 
to Chapter 10 of Lothaire |2 for the basic notions and properties of morphisms 
left undefined here. 

Theorem 9. There exists a uniformly recurrent infinite word with bounded 
Abelian complexity that does not begin in an Abelian square. 



Proof. Consider the fixed point z of the morphism g: i— )• 0111110, 1 1— ?• OHIO. 

•22^ 
.5 3. 



The incidence matrix of 5 is (| § ) with eigenvalues 



Ai = -(S + VIl) = 5.70... and Ai = -(5 - VIl) = -0.70 . . . . 

Therefore g is a Pisot morphism, and by a result of Adamczewski [1] , the word 
z is C-balanced for some C. Thus according to Lemma [1] it also has bounded 
Abelian complexity. Since g is a, primitive morphism, z is also uniformly recur- 
rent (see also Remark [T7|) . Therefore we only need to show that z does not have 
a prefix that is an Abelian square. Suppose that uv is a prefix of z with u and 
V Abelian equivalent. Since \uv\q is even, we see that uv = h{w) for some word 
w. Furthermore, since \uv\i is even, it follows that \w\ is even, so that there 
exist wi and W2 such that w = W1W2 and |wi| = \w2\- Since |3(0)|o = 2 and 
\g{l)\o = 2, we have |ff(wi)|o = \g{w2)\o, and thus u = g{wi) and v = 5(^2)- 
Furthermore, since |(7(0)|i > |(7(l)|i it follows that the number of Os and Is in 
the words wi and W2 is the same; in other words they are Abelian equivalent. 
Therefore z has a prefix that is shorter than uv and an Abelian square. The 
claim now follows by induction. D 

Remark 10. The infinite word z constructed in the proof of the previous the- 
orem avoids Abelian squares only in the beginning: There is an Abelian square 
in every position except in first one. 

4 Morphisms forcing bounded AbeUan complex- 
ity 

It is an immediate fact (see also Theorem |3]) that the Thue-Morse morphism 
maps all words to words with bounded Abelian complexity. We characterize 
the class of all morphisms sharing this property. This result will be useful in 
Section [5] for providing an example of a word having infinitely many positions 
without Abelian squares. 



For a vector v = {vi, . . . , Vk) G N , we denote \\v\\ — J2 



"' ■nTi~< /^ r^t-i /~t-i- 1~< /] 1 \ 



V, 



Theorem 11. A morphism f : A —i' B maps all words to words with bounded 
Abelian complexity if and only if there exists v G N'^^ such that, for each letter 
a E A, there exists an integer Ka such that '${f{a)) — KaV. 



Proof. Suppose that /(w) has bounded AbeUan complexity for all infinite words 
w. Consider two different letters, say and 1, and denote k = |/(0)|, i = |/(1)|- 
Assume first that *(/(00) ^ *(/(!'')) and observe that |/(00| = |/(l'')| (= 
1/(0)11/(1)1). Now observe that lim„^oo||*(/(0"^)) - *(/(l"'=))|| = oo. So the 
word w = /(ni>o O'l*) is not C-balanced for any integer C, and therefore by 
Lemma[Tl the Abelian complexity of w is not bounded. Thus if the morphism / 
maps any word to a word with bounded Abelian complexity, then for all letters 
a,6 e A we must have ^(/(al^(^)l)) = *(/(6l/(")l)), that is, |/(6)|^(/(a)) = 
|/(a)|\l/(/(6)). Now for each letter a & A, let Ka denote the gcd of the entries 
of $(/(a)); then we have ^(/(a)) — KaVa for some vector Va G N^^. Then 
|/(6)|*(/(a)) = |/(a)|*(/(6)) implies that Va = vt, for all letters a, b. 

Conversely, suppose that there exists v G N*^ such that for all letters a € A, 
there exists an integer Ka such that ^^(/(a)) = KaV. If \\v\\ — 0, then /(w) 
is empty for all words w, and the claim trivially holds. So let us assume that 

ll^ll^o. 

Let w be an infinite word, and denote M = max{ |/(a)| : a € A}. If w is 
a subword of w longer than M, then it is of the form sf{u)p with s, u,p three 
words such that s is a suffix of /(a) for a letter a, p is as prefix of /(6) for a 
letter b. Observe that 



*(w) = *(/(w)) + *(sp) = 



.a£A 



v + '${sp). 



Denoting C„ = J2a£A\'^\aKa, we thus have \w\ = C„||w|| + ||*(sp)||. Then 



ii^ii ii^ii 

and consequently, 

\w\-2M ^ ^, ^ Iwl „,^-. 

\\n \M 

where the inequality means coordinate- wise inequality and 1 = (1,1,...,!). It 
follows that the word /(w) has bounded Abelian complexity. D 

5 Morphisms preserve bounded Abelian com- 
plexity 

Having shown in the previous section that the image of an infinite word w under 
a morphism can have unbounded Abelian complexity, we show that this cannot 
happen when the word w itself has bounded Abelian complexity. 

Theorem 12. Let A and B be two alphabets, let w be an infinite word over 
A*, and let f : A* ^^ B* he a, morphism. If the Abelian complexity of w is 
bounded, then the Abelian complexity of /(w) is bounded. 



Remark 13. No restrictions are imposed on the morphism /. In particular, it 
could be erasing. In this case the word /(w) may be finite, but then its Abelian 
complexity is trivially bounded. 

Proof of Theorem \12l The claim is trivially true if /(w) is finite, so let us as- 
sume that /(w) is infinite. By assumption there exists an integer K such that 
Pw('^) — ^ fo'" ^11 n > 1. It follows from Lemma [T] that w is C-balanced for 
some integer C > 1. Let us denote 

ifi = C^|/(a)|, 

M = max{ |/(a)| : a€ A}, 

K2 = max{ \y\ : y is a subword of w and \f{y)\ < Ki + M }, 

K:i = K{M#Af{K2 + l). 

Since /(w) is infinite, we have Ki ^ and M ^ 0. We need to prove that 
K2 exists. Clearly, the set defining K2 is not empty. Let a be a letter that 
occurs infinitely often in w and such that /(a) 7^ e. Let m be the minimal 
length of subwords of w containing at least C + 1 occurrences of a. Since w is 
C-balanced, any subword of w of length m contains at least one occurrence of 
a, and so any subword of length m{Ki -|- M -I- 1) contains at least Ki -I- M -I- 1 
occurrences of the letter a: for any such subword y, |/(y)| > -ftTi -I- M. Thus the 
set defining K2 is finite and therefore the number K2 exists. 

We will show that pf(^){n) < K^ for all n > M; this implies that the Abelian 
complexity of /(w) is bounded by ma,x{K3, pf^^.^{0), ..., pf(^){M - 1)}. 

For any integer n > 0, let us denote by Tn the set of triplets {s,u,p) of 
words u £ A* and s,p G B* such that there exist two letters a and /3 with 
au/S a subword of w, s a proper suffix of /(a), p a proper prefix of /(/3) and 
\sf{u)p\ = n. Observe that any subword of /(w) of length n > M can be 
decomposed sf{u)p with {s,u,p) e T,i- Denote also 5„ — {m : {s,u,p) € 
Tn, \u\ = to}. 
Claim. For all n > 0, we have #5„ < K2 + 1. 

Proof of the claim. Assume that to,i, m2 G 5„ with mi > m2, and let (si, ui,pi) 
and {s2tU2tP2) be two triplets in Tn such that |iii| — mi and \u2\ = to,2. We 
decompose ui into ui = xy with |a;| = TO2. By definition of 7^, words ui and 
U2 are subwords of w (and so x is also a subword of w) . Therefore since w is 
C-balanced, we have |u2|a < |a;|a + C for all letters a £ A. Thus 

1/(^2)1 = El"2|a|/(a)| < Y.^\x\a + C)\f{a)\ = \f{x)\+Ki. 

As a consequence of the definition oi Tn, we have |/(u2)| ^ n ~ IS2P2I and 
|/(a;)| = n- \f{y)\ - \sipi\. Thus |/(y)| + |sipi| - IS2P2I < Ki which implies 
\fiy)\ < Ki+M and so, by definition of i^2, \y\ < ^2- Hence < toi — TO2 < K2 
and elements of 5„ can take at most K2 + 1 different values. This ends the proof 
of the claim. D 



We continue the proof of Theorem [T^l As noted before, if n > Af , then 
p'f(^\{n) equals the number of Parikh vectors of words of the form sf{u)p 
with {s,u,p) G Tn- Now observe that if {s,ui,p) and {s,U2,p) are in %i 
with ^{ui) — ^(^2), then ^(s/(ui)p) = 'i{sf{u2)p)- Therefore the quan- 
tity p°jf',^Jn) is bounded by the number of triplets of the form (s, '^{u),p) with 
(s, u,p) € Tn- By the previous claim, we know that a word u such that a triplet 
of the form (., u, .) is in Tn can take at most K2 + 1 different lengths. Thus by 
hypothesis, we get at most K{K2 + 1) different possible vectors 'i'{u). Moreover 
there are at most M^A possibilities for s and at most M^A possibilities for 
p. Hence the cardinality of {(s, '^{u),p) : {s,u,p) £ 7^i} is bounded by K3, and 
the Abelian complexity of /(w) is therefore bounded. D 

Remark 14. The converse of Theorem [12] does not hold; in fact, by Theorem[3l 
the Abelian complexity of /x(w) is bounded for any binary word w. 

6 Avoiding Abelian squares in infinitely many 
positions 

Theorem 15. There exists a uniformly recurrent infinite word with bounded 
Abelian complexity in which there are infinitely many positions where no Abelian 
square occur. 

To prove this theorem, we first state an important property of the uniform 
morphism /: {0, 1}* — !• {0, 1}* defined by 

/(O) = 00011 and /(I) = 01100. 

Lemma 16. Let w be an infinite binary word, and suppose that /(w) begins 
in a word of the form OOOluu, where u and v are nonempty Abelian equivalent 
words. Then w has a prefix of the form OxOyO for some words x and y with 

N = M- 

Proof. We divide the proof into five cases depending on the remainder of |w| 
mod 5; 

1. We have |m| = (mod 5). Then, since \u\ ^ 0, the word u is of the 
form l/(a;)0001 or l/(a;)0110. In the first case v must be of the form 
lf{y)0001 or l/(y)0110 with \x\ = \y\; in the second case v is of the form 
0/(?/)0001 of 0/(y)0110. In both cases, |a;| = \y\ so f{x) and /(y) are 
Abelian equivalent. Since u and v are Abelian equivalent, it follows that 
the only possible case is u = l/(a;)0001 and v — lf{y)0001. Then w has 
prefix OxOyO, and the claim holds. 

2. We have |m| = 1 (mod 5). Then u is of the form u = l/(x) and v is of the 
form V = /(y)0 for some words x and y. This is, however, a contradiction 
because u and v are Abelian equivalent. 
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3. We have |u| = 2 (mod 5). Then u is of the form u = lf{x)0 and v has 
one of the foUowing four forms: 

0011/(2;)000, 0011/(y)011, 1100/(y)000, 1100/(z/)011. 

But regardless of which of these forms v may have, the words u and v are 
not Abehan equivalent, a contradiction. 

4. We have \u\ = 3 (mod 5). Then either u is of the form l/(a;)00 and v is of 
the form Oil f{y), or u is of the form l/(a;)01 and v is of the form 100 f{y). 
But again we quickly verify that u and v cannot be Abelian equivalent in 
either situation, a contradiction. 

5. We have |u| = 4 (mod 5). We separate two possibilities: Either u = 
l/(a;)000 or u = l/(a;)011. In the first situation, we have either v = 
llf{y)00 or ll/(y)01; in the latter situation we have either ■;; = 00/(y)00 
or V = 00f{y)01. But, as in the previous cases, we see that u and v cannot 
be Abelian equivalent. This contradiction concludes the proof. 

D 

Proof of Theorem \15\ Let h be the uniform morphism defined by h{0) — 01011111 
and h{l) — 11101111, and let w^ be the fixed point of h beginning with the let- 
ter 0. Our goal is to show that the infinite word /(w^) is a uniformly recurrent 
word avoiding Abelian squares in infinitely many positions. To achieve this, we 
need the following four steps: 

1. Let w be any binary word beginning with the letter 0, and suppose that 
the word /i(w) has a prefix of the form OxOyO with |x| = \y\. Then there 
exist words x' and y' with \x'\ = \y'\ such that Da; = h{Ox'), Oy = h{Qy'), 
and Ox'Oy'O is a prefix of w. 

Proof. Let the letters in h{w) be indexed starting from 1. Since \h{0)\ = 
\h{l)\ — 8, the way the letter occurs in h{0) and h{l) implies that 
can occur in h{w) only at positions congruent to 1, 3, or 4 (mod 8). 
Therefore |0a::| is congruent to 0, 2, or 3 (mod 8); accordingly |0x0j/0| 
must be congruent to 1, 5, or 7 (mod 8), respectively. Only the first case 
is possible, and thus we have Oa; = h{Ox'), Oy — h{Oy') for some x', y' with 
|a;'| — \y'\ such that Ox'Oy'O is a prefix of w. D 

2. The word w^ does not have a prefix of the form OxOyO with \x\ — \y\. 
Proof. Immediate consequence of the first step. D 

3. The word w/,, does not have a prefix of the form OlOxOj/0 with |a:;| = \y\. 
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Proof. Suppose that OlOxOj/O with |a;| = \y\ is a prefix of w/j. Since the 
letter occurs only in positions congruent to 1, 3, or 4 (mod 8), we 
readily check that |0a;| = \0y\ = (mod 8). Hence the fact that 010a;0 
is a prefix of Wh implies that 010x0 = h{0x')01Q for some x' . Further, 
since OlOxOyO is a prefix of w/i and OlOxOyO = h{Ox')010yO, it follows that 
OlOyO = ft,(0y')010 for some y' . Consequently we have, 

010a;OyO = h{Ox')010yO = h{0x')h{0y')010, 

and so Wh has prefix h{0x'0y'0) and thus a prefix Ox'Oy'Q, where \x'\ = \y'\. 
This contradicts the previous step. D 

4. The word w;i does not have a prefix of the form /i"(01)0a;0j/0 with |a;| = \y\. 

Proof. Suppose that Wh does have prefix h"{01)QxOyO with \x\ = |?;|. By 
the previous step, we have n > 1. Denote "w^ = Olz. Since h"'{'Wh) = y^h, 
we see that /i"(z) has prefix OccOyO, and so by the first step, the word 
/i"^^(z) has prefix Ox'Oy'O for some x',y' with \x'\ — \y'\. Consequently 
w^ has a prefix of the form h^~^{Ql)Qx'Oy'Q, which is a contradiction by 
induction. D 

Finally we are ready to wrap up the proof of Theorem 1151 The word w^ 
is uniformly recurrent since it is a fixed point of a primitive morphism. Thus 
/(w^) is uniformly recurrent as well. Its Abelian complexity is bounded by 
Theorem [TTl For n > 0, let w„ be the word determined by w/j = /i"(01)w„. By 
the previous step, w„ does not have a prefix of the form OxOyO with |a;| = \y\. 
According to Lemma [161 the word /(w„) does not have a prefix of the form 
OQOluv with u and v Abelian equivalent. This shows that there exist infinitely 
many positions in /(w/j) in which no Abelian square occurs. D 

Remark 17. An infinite word is linearly recurrent if, for all subwords w, any 
two consecutive occurrences of w are within L\w\ positions, where L > is 
constant. The infinite words constructed in the proofs of Theorems [9] and [15] 
are not only uniformly, but even linearly recurrent. In fact, F. Durand [5J showed 
that every fixed point of a primitive morphism is linearly recurrent (see also [6] 
for a precise statement of this). 

7 The general case 

The set of positions in the infinite word /("Wh) given in the proof of Theorem ll5l 
in which no Abelian squares occurs has density 0. This suggests the following 
question: Does there exist an infinite word with bounded Abelian complexity 
avoiding Abelian cubes in a set of positions with positive density? The answer 
is no. Indeed, by using Szmeredi's theorem ^17; instead of van der Waerden's 
theorem in the proof of Theorem 5.1 in J12j . we get the following result: 
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Theorem 18. Let w be an infinite word with bounded AbeUan complexity, 
and let £) C N be a set of natural numbers with positive upper density, that is 

Dn{l,2,...,n} ^ ^ 
limsup > 0. 

n— ^oo 1^ 

Then, for every integer fc > 2, there exists an integer i £ D such that there is 
an Abelian fc-power occurring at position i in w. 

This result naturally leads to the following question. 

Open problem 1. Does there exist an infinite binary word that avoids Abelian 
squares in positions with positive upper density? 

Let us consider a slightly modified version of this question. We say that 
an infinite binary word w satisfies BAS property if the set of positions in w 
that avoid Abelian squares have positive lower density. This means that such 
positions in w occur in bounded gaps. In what follows we show that the existence 
of a word with BAS property is equivalent to the following well-known question 
asked independently by Pirillo-Varricchio [TT] and Halbeisen-Hungerbiihler [5] : 

Does there exist an infinite word over a finite set of integers such that 
no two consecutive blocks of the same length have the same sum? 

We say that an infinite word satisfying the above condition has the PVHH 
property. 

Theorem 19. There exists an infinite word satisfying the PVHH property if 
and only if there exist an infinite binary word satisfying the BAS property. 

We prove the claim in the next five lemmas. If x is a finite word whose 
letters are integers, let ^ x denote the sum of the letters in x. 

Lemma 20. If there exists a word satisfying the PVHH property, then there 
exists an infinite binary word satisfying the BAS property. 

Proof. Suppose that an infinite word x = xia;2 . . . satisfies the PVHH property. 
Suppose that x is over an alphabet A = {ai, 02, . . . , a^}, where each a^ is an 
integer. Since the set of words satisfying the PVHH property is clearly closed 
under affine transformations, we may assume that each a, is an odd positive 
integer. Define a morphism r by T{ai) = 01"'0. We claim that the word t(x) 
has the BAS property. We prove this by showing that, for all i = 1, 2, . . ., the 
word z := T{xiXi+i ■ ■ ■) does not have an Abelian square as a prefix. 

Suppose the contrary; a word uv is a prefix of z with u and v Abelian 
equivalent. We proceed by the following reasoning analogous to what we used 
in the proof of Theorem[9l Since \uv\q is even, the form of the T[ai) implies that 
uv = t{w) for some word w d A. Similarly, since \uv\i is even, it follows that 
\w\ is even, and we can write w = W1W2 with \wi\ — \w2\. Since |T(ai)|o — 2 for 
all i, we have |r(i(;i)|o = |''"(w2)|o; and thus u — t{wi) and v — t{w2). But now. 



Hi ^^a^\wi\ai ^^wi and \v\i ^^ai\w2\a, ^^ 



W2 
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which contradicts the PVHH property because |u|i = |w|i. D 

We begin proving the converse by defining the following word. Let M be a 
positive integer, and let v = wqWi ... be the periodic sequence given by 

_ J 2 " ■"°'i ^'^ if n ^ - 1 (mod M) 
^"~ [1-2^^-1 otherwise. 

Lemma 21. Let x, y be two factors of v. If ^ x = ^ y, then |a;| = \y\ mod M. 

Proof. Observe first that J2l7=o ^n = 2* '^°^ ^^ — 1. Thus if x occurs at position 
i and has length j - i, then '£x = 2^ '^°'^ *^ - 2' '^°'^ *^. If ^ x = 0, then \x\ = 
(mod M). Otherwise, the congruence classes mod M of i and j, hence of |a;|, 
can be recovered from '^x. □ 

Lemma 22. Let u = uqui ... be any infinite binary word, and define a word 
w = wqwi ... by 

Wn^Un- 2*^ +Vn- 

If xy is a factor of w, say x = WiWi+i ■ ■ ■ Wj-i and y — WjWj^i ■ ■ ■ Wk-i, with 
^x = ^y, then |a::| = |y| (mod M) and 

j-i fe-1 

^Uh = ^Uh. (1) 

h—i h—j 

Proof. First off, we have 

j-i j-i fc-i fe-i 

h=i h=i h=j h—j 

Since ^x — ^y^ this gives 

h—i h—j h—i h—j 

The inequahty 

j — 1 k — l 7~1 k—1 

\Y.v, ~Y.v>^\ < \Y,v,\ + \j2v,\ < 2 ■ (2*^-1 - 1) < 2*^ 

h—i h—j h—i h^j 

thus implies that 

7-1 fe-l J-l fe-l 

{y^^h-^Uhj =0 and (E "'» ~ E "'» ) ^ *^- 

h—i h—j h^i h^j 

The first equation gives ([l} , and the latter equation with Lemma [21] implies 
that l^l = \y\ (mod M). D 
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Lemma 23. Let iV be a positive integer. Suppose that u is a binary word 
with positions no, ni,n2, ■ ■ ■ such that iN < rii < (i + 1)N and each position Ui 
avoids Abehan squares. Define a new word z by 

Zi = Wn, + Wn,+1 H h Wm+^-l, 

where w is the word defined in the previous lemma with M — 2N. Then z 
satisfies the PVHH property. 

Proof. Suppose that xy is a factor of w such that X) ^ = S J^ ^^'^ 1^1 = I?/ 1- We 
have 

ZiZi+i ■ ■ ■ Zi+i-i and y = Zi+iZi+e+i ■ ■ ■ Zi+2e-i 

for some £ > 1. Let x' and y' denote the corresponding decoded words in w, 
that is to say, let 

X' = Wn,Wni + l ■ ■ ■ Wn,^i-l and y' = Wni^fWn,+^ + l ■ ■ ■ W^.^^^^-l. 

Since 'Y^x' ='Y^y' , the previous lemma says that \x'\ = \y'\ (mod 2iV) and 

^ Mft = ^ Uh. (2) 

h—rii h—rii^i 

On the other hand, |a;'| = rii+i — Ui and \y'\ — ni+21 — ni+i, so that 

-2N < \y'\ - \x'\ < 2N. 

Therefore |a;'| = \y'\. But this and Eq. ^ means that there is an Abelian square 
of length \x'y'\ in u occurring at position Ui, a contradiction. D 

Lemma 24. If there exists an infinite binary word satisfying the BAS property, 
then there exists an infinite word satisfying the PVHH property. 

Proof. If u is an infinite binary word with the BAS property, then it satisfies 
conditions of the previous lemma for some integer TV, and therefore a word 
satisfying the PVHH property exists. D 
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